SEQUENCE COMPARISON US 5,952,223 Part of 09/990,437 page 9 

; Patent No. 5952223 

; GENERAL INFORMATION: 

APPLICANT: Kaplan, Jerry 
; APPLICANT : Perou, Charles 
; APPLICANT : Moore, Karen 

; TITLE OF INVENTION: COMPOSITIONS FOR THE DIAGNOSIS 

TITLE OF INVENTION: AND TREATMENT OF CHEDIAK-HIGASHI SYNDROME 

; NUMBER OF SEQUENCES: 32 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 0 8/ 822 , 4 4 5 

; FILING DATE: 21-MAR-1997 

INFORMATION FOR SEQ ID NO: 2: 

; SEQUENCE CHARACTERISTICS: 

; LENGTH: 2186 amino acids 

; TYPE: amino acid 

; TOPOLOGY: unknown 

MOLECULE TYPE: protein 

; FRAGMENT TYPE: internal 

US-08-822-445-2 

Query Match 16.8%; Score 882; DB 2; Length 2186; 

Best Local Similarity 27.9%; Pred. No. 1.2e-70; 

Matches 295; Conservative 147; Mismatches 338; Indels 276; Gaps 43; 

Qy 5 EMDTYAKSHDLMSGFWNACYDMLMSSGQRRQWERAQSR-RAFQELVLEPAQRRARLEGLR 63 

I I : I I : : : | : : : | : : : : : | : : | : 

Db 1231 EQKKYESEESVSKGSW QKTVNNNQQSLFQRLDFKSKDI SKIAADITQAVSLSQGIE 1286 

Qy 64 YTAVLKQQATQHSM ALLHWGALWRQLAS PCGAWALRDTPI — -PRWKLSSAETYSRM 117 

I : : : : III : I I I II hi I : I 

Db 12 87 RKKVIQHIRGMYKVDLSASRHWQECIQQLTHDRAVWY DPI YYPTSWQLDPTEGPNRE 1343 

Qy 118 RLKL VPNHHFDPHLEASALRD-NLGEVPLTPTEEASLPLAVTKEAKVSTPPELL 170 

I : I : I I : III III I I : I I : 

Db 1344 RRRLQRCYLTI PNKYL LRDRQKSEGVLRP PLSYLFEDKTHSSFSST 1389 

Qy 171 QEDQLGEDELAELETPMEAAELDEQREKLVL'SAECQLVTWAWPGLLEVTTQNVYFY-D 229 

: I : : : : I I : I : I : I : I I I 

Db 1390 VKDKAASESIRVNRRCISVAPSRETAGELLL-GKC GMYFVED 1430 

Qy 230 GSTERVETEEGIG YDFRRPLAQLREVHLRRFNLRRSALELFFIDQANYFLNFPCKV 285 

= : = I I : I I : : : I | | | : | | : | : | : | : | | 
Db 1431 NASDAVESSSLQGELEPASFSWTYEEIKEVHRRWWQLRDNAVEIFLTNGRTLLLAF 14 86 

Qy 286 GTTPVSSPSQTPRPQPGPI PPHTQVRNQVYSWLLRLRPPSQGYLSSRSPQ — EMLRASGL 343 

: : I I : I I I I : : I I : I 

Db 1487 DNNKVRDDVY QSILTNNLPNLLEYGNITAL 1516 

Qy 344 TQKWVQREISNFEYLMQLNTIAGRTYNDLSQYPVFPWVLQDYVSPTLDLSNPAVFRDLSK 403 

I I : I : I I I I I II I I I :: I I I I I I I I I : : I I I I I I I I I : : I : : : I : I I I 
Db 1517 TNLWYSGQITNFEYLTHLNKHAGRSFNDLMQYPVFPFILSDYVSETLDLNDPSI YRNLSK 1576 

Qy 404 P I GWN P KHAQLVREKYE S FE DPAGTIDKFHYGTHYSNAAGVMHYLIRVE 453 

III : : I : I II : : I I I : I I I I : I : I : I : I : 

Db 1577 PIAVQYKEKEDRYVDTYKYLEEEYRKGAREDDPMPPVQPYHYGSHYSNSGTVLHFLVRMP 1636 



Qy 454 PFTSLHVQLQSGRFDCSDRQFHSVAAAWQ-ARLESPADVKELIPEFFYFPDFLENQNGFD 512 

III:: I II I I I I I I : : II II I I I I I I I I I I : I I I : I I I 

Db 1637 PFTKMFLAYQDQSFDIPDRTFHSTNTTWRLSSFESMTDVKELIPEFFYLPEFLVNREGFD 1696 

Qy 513 LGCLQLTNEKVGDWLPPWA-SSPEDFIQQHRQALESEYVSAHLHEWIDLIFGYKQRGPA 571 

I I I : I I I I I I I : I II I I I I I I I : : I I : : I I I I : I M I I : I I 
Db 1697 FGVRQ-NGERVNHVNLPPWARNDPRLFILIHRQALESDHVSQNICHWIDLVFGYKQKGKA 1755 

Qy 572 AEEALNVFYYCTYEGAVDLDHVTDERERKALEGI I SNFGQTPCQLLKEPHPTRLSAE 628 

: : I : I I I : I I I : I : II : I : II I : I : I I I I I I I : I I : 

Db 175 6 SVQAINVFHPATYFG-MDVSAVEDPVQRRALETMIKTYGQTPRQLFHTAHASRPGAKLNI 1814 

Qy 629 EAAHRLAR LDTNSPS I FQHLDELKAFFAEVTVSASG 664 

: I I I : II : | | : | I 

Db 1815 EGELPAAVGLLVQFAFRETREPVKEVTHPSPLSWIKGLK — WGEYVGSPSAPVPWCFSQ 1872 

Qy 665 LLGTH SWLPYDRNIS 679 

III: I I I I I 

Db 1873 PHGERFGSLQALPTRAICGLSRNFCLLMTYNKEQGVRSMNNTNIQWSAILSW-GYADNIL 1931 

Qy 680 NYFSFSKDPTMG SHKTQRLLSGPWVPGSGVSGQALAVAPDGKLLFSGGH WD 730 

I : I : | : :: | | | | | | | : | : 

Db 1932 RLKSKQSEPPINFIQSSQQHQVTSCAWV PDSCQLFTGSKCGVITAYT 1978 

Qy 731 GSLRVTALPRGKLLSQLSC--HLDWTCLALDTCGIY--LISGSRDTTCMVW RLLHQ 783 

I : : : I I : I : : I I : I I : II I I I I I : I I I I : 

Db 1979 NRLTSSTPSEIEMESQMHLYGHTEEITGLCV--CKPYSVMISVSRDGTCIVWDLNRLCY- 2035 

Qy 784 GGLSVGLAPKPVQVLYGHGAAVSCVA ISTELDMAVSGSE DGTVIIHT 830 

I I I 1 I : I : I : hi II I h : I : : I 

Db 2036 VQSLAGHKSPVTAVSASETSGDIATVCDSAGGGSDLRLWTWGDLVGH- 2083 

Qy 831 VRRGQFVAAL RPLGATFPGPIFHLALGSEGQIWQSSAWE RPG 873 

I : : : : : I I : I : I I I I I I h : I 

Db 2084 VHCREIICSVAFSNQPEGVS INVIAGGLENGIVRLWSTWDLKPVREITFPKSNKPI 2139 

Qy 874 AQVTYSL HLYSVNG KLRASLPL 895 

: I : I I I I : I : I I h 

Db 2140 ISLTFSCDGHHLYTANSEGTVIAWCRKDQQRVKLPM 2175 



RESULT 3 
US-09-396-540-2 

; Sequence 2, Application US/09396540 

; Patent No. 6310182 

; GENERAL INFORMATION: 

; APPLICANT: Kaplan, Jerry 

; APPLICANT: Perou, Charles 

; APPLICANT: Moore, Karen 

TITLE OF INVENTION: COMPOSITIONS FOR THE DIAGNOSIS 
; TITLE OF INVENTION: AND TREATMENT OF CHEDIAK-HIGASHI SYNDROME 

; NUMBER OF SEQUENCES : 32 
; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: Pennie & Edmonds 

; STREET: 1155 Avenue of the Americas 

; CITY: New York 

; STATE: New York 



; Sequence 1, Application US/08822445 

; Patent No. 5952223 

; GENERAL INFORMATION: 

; APPLICANT: Kaplan, Jerry 

APPLICANT: Perou, Charles 
; APPLICANT: Moore, Karen 

TITLE OF INVENTION: COMPOSITIONS FOR THE DIAGNOSIS 
; TITLE OF INVENTION: AND TREATMENT OF CHEDIAK-HIGASHI SYNDROME 

NUMBER OF SEQUENCES: 32 

CURRENT APPLICATION DATA: 
'; APPLICATION NUMBER: US/08/822,445 

FILING DATE: 21-MAR-1997 
; INFORMATION FOR SEQ ID NO: 1: 

SEQUENCE CHARACTERISTICS: 
; ^EfJGTH: 6830 base pairs 

TYPE: nucleic acid 
; STRANDEDNESS : single 

TOPOLOGY: linear 

MOLECULE TYPE: DNA 

FEATURE : 

; NAME/KEY: Coding Sequence 

LOCATION: 1. . . 6558 
; OTHER INFORMATION: 

US-08-822-445-1 

Query Match 5.0%; Score 176.8; DB 2; Length 6830; 

Best Local Similarity 52.9%; Pred. No. 5.5e-31; 

Matches 475; Conservative 0; Mismatches 387; Indels 36; Gaps 3; 



Qy 1312 CTCAGGCCTTACCCAGAAATGGGTACAGCGTGAGATATCCAACTTCGAGTACTTGATGCA 1371 

I I III M I Ml I III I I I I I I I I I I I I I I II 

Db 4539 CAC C G C T CT GACAAAC C T GT GGT AT T C T G GACAAAT T AC CAAT T T T GAAT AT T T GAC T CA 4598 

Qy 1372 ACTCT^ACACCATTGCGGGGCGGACCTACAATGACCTGTCTCAGTACCCTGTGTTCCCCTG 1431 

I I I I I I I I I I II MUM II I M II I II M M M II M 

Db 45 99 TTTAAACAAGCATGCGGGCCGGTCCTTCAATGATCTCATGCAGTACCCGGTGTTCCCCTT 4 65 8 

QY 1432 GGTCCTGCAGGACTACGTGTCCCCAACCCTGGACCTCAGCAACCCAGCCGTCTTCCGGGA 1491 

MM II II I M II I I I I I I M I I I I I I I II I I I 

Db 4659 CAT C C T T T C T GAC TAT GT TAGT GAGAC T C T T GAC C T CAAT GAT C CAT C TAT C T ACAGAAA 4718 

QY 14 92 C CT GTCTAAGCC CATC GGT GT GGT GAACCCCAAGCATGCCCAGCTCGT GAG GGAGAAGTA 1551 

M I M I II I M I I I I I I I || I || || M | || 

Db 4719 CCTATCTAAGCCTATAGCTGTGCAGTATAAAGAAAAAGAAGACCGTTACGTTGACACATA 477 8 

Qy 1552 TGAAAGCTTTGAG GAC C C AG C AGG GAC CAT 15 81 

I Mill I Mill II 

Db 4779 CAAGT AC T T G GAGGAG GAGT AT C G C AAG G GAG C T C GAGAG GAT GAC C C CAT GCCTCCTGT 4 838 

QY 1582 T GAC AAGT T C CAC TAT G G CAC C CAC T AC T C CAAT G CAG C AG G C GT GAT G C ACT AC C T CAT 1641 

I I I I I I M M I I I I II II II II II I I || | | | M | | | || | 

Db 4839 GCAACCCTACCACTATGGCTCCCACTACTCCAACAGCGGCACCGTGCTCCACTTCCTGGT 4 8 98 

Qy 164 2 CCGCGTGGAGCCCTTCACCTCCCTGCACGTCCAGCTGCAAAGTGGCCGCTTTGACTGCTC 17 01 

I I M I M I II II II I | M | I I | | | | | 

Db 4 899 CAG GAT GCCGCCTTT CAC TAAAAT GT T T C TAG C CT AT CAAGAT CAGAGT T T C GACAT T C C 4 95 8 



Qy 1702 CGACCGGCAGTTCCACTCGGTGGCGGCAGCCTGGCAG GCACGCCTGGAGAGCCCTGC 1758 

I I I I I I I I I I I I I I I I I I I I I M I I | 

Db 4 959 AGACCGAACATTTCATTCTACAAACACAACTTGGCGCCTCTCCTCCTTTGAGTCCATGAC 5018 

Qy 1759 CGATGTGAAGGAGCTCATCCCGGAATTCTTCTACTTTCCTGACTTCCTGGAGAACCAGAA 1818 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | I M I I I 

Db 5019 TGATGTGAAGGAGCTGATTCCAGAGTTTTTCTATCTTCCTGAGTTCTTAGTGAACCGTGA 507 8 

Qy 1819 CGGTTTTGACCTGGGCTGTCTCCAGCTGACCAACGAGAAGGTAGGCGATGTGGTGCTACC 187 8 

II I I I I I I I I I I I I I I I I || | I || 

Db 5 07 9 AGGCTTTGACTTCGGTGTTCGTCAGAATGGAGAGCGGGTTAACCACGTCAATCTTCCTCC 513 8 

Qy 187 9 CCCGTGGGCCAGCTCTCCTGAGGACTTCATCCAGCAGCACCGCCAGGCTCTGGAGTCGGA 193 8 

I I Ml I M I I I I I I I I I I I I I I II I I I I I II I I I I 

Db 5139 CTGGGCACGCAACGATCCTCGGCTGTTCATCCTTATTCACCGGCAAGCACTAGAGTCTGA 5198 

Qy 193 9 GT AT GT GT CT G C AC AC C T AC A C GAGT G GAT C GAC C T CAT C T T T G G C T AC AAG C AG C G G G G 1998 

I I I I I M Ml I MINIMI I I II I I II I I II I I I I III 

Db 5199 C CAT GT GT C C C AGAAC AT C T G T C AC T G GAT C GAC T T AGT GT T T G G C T AC AAG C AAAAG G G 5258 

Qy 1999 GCCAGCCGCCGAGGAGGCCCTCAATGTCTTCTATTACTGCACCTATGAGGGGGCTGTAGA 2 05 8 

I M II I I I I II I I I I I I I I I I Mill II IN 
Db 5259 GAAGGCGTCTGTTCAAGCCATCAATGTCTTCCACCCTGCTACATATTTTGG AATGGA 5315 

Qy 2 059 CCTGGACCATGTGACAGATGAGCGGGAACGGAAGGCTCTGGAGGGCATTATCAGCAACTT 2118 

I II MM II I I II I I I I I I I I I I III 

Db 5316 TGTCTCTG C AGT T GAAGAT C C AGT G C AGAGAC G GG C T T T AGAAAC CAT GAT AAAAAC CT A 5375 

Qy 2119 TGGGCAGACTCCCTGTCAGCTGCTGAAGGAGCCACATCCAACTCGGCTCTCAGCTGAG 217 6 

M II I I II II I I I II I I I I I II I I I III Ml II 

Db 537 6 CGGGCAGACCCCACGTCAGTTGTTCCACACAGCCCATGCCAGCCGACCTGGAGCCAAG 5 433 



SEQUENCE COMPARISON US 6,607,879 Part of 09/990,437 page (S 

; Sequence 827, Application US/09023655 
; Patent No. 6607879 
; GENERAL INFORMATION: 

APPLICANT: Cocks, Benjamin G. 
; APPLICANT: Susan G. Stuart 

APPLICANT: Jeffrey J. Seilhamer 
; TITLE OF INVENTION: COMPOSITION FOR THE DETECTION OF BLOOD CELL GENE 

; TITLE OF INVENTION: EXPRESSION 

NUMBER OF SEQUENCES : 15 08 
; INFORMATION FOR SEQ ID NO: 827: 
; SEQUENCE CHARACTERISTICS: 

; LENGTH: 1595 base pairs 

TYPE: nucleic acid 
; STRANDEDNESS: single 

; TOPOLOGY: linear 

; IMMEDIATE SOURCE: 

; LIBRARY: LUNGAST01 

; CLONE: 873034 

US-09-023-655-827 



Query Match 38.7%; Score 1367.2; DB 4; Length 1595; 

Best Local Similarity 93.6%; Pred. No. 4.2e-304; 

Matches 1494; Conservative 0; Mismatches 10; Indels 92; Gaps 3; 



Qy 


1807 


GGAGAACCAGAACGGTTTTGACCTGGGCTGTCTCCAGCTGACCAACGAGAAGGTAGGCGA 
1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | M II 1 II 1 
GGAGAACCAGAACGGTTTTGACCTGGGCTGTCTCCAGCTGACCAACGAGAAGGTAGGCGA 


1866 


Db 


1 


60 


Qy 


1867 


TGTGGTGCTACCCCCGTGGGCCAGCTCTCCTGAGGACTTCATCCAGCAGCACCGCCAGGC 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I | | | | | 
TGTGGTGCTACCCCCGTGGGCCAGCTCTCCTGAGGACTTCATCCAGCAGCACCGGCAGGG 


1926 


Db 


61 


120 


Qy 


1927 


TCTGGAGTCGGAGTATGTGTCTGCACACCTACACGAGTGGATCGACCTCATCTTTGGCTA 
1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I | | | | | | | | | | | | | | | || 
TCTGGAGTCGGAGTATGTGTCTGCACANCTACACGAGTGGATCGACCTCATCTTTGGNTA 


1986 


Db 


121 


180 


Qy 


1987 


CAAGCAGCGGGGGCCAGCCGCCGAGGAGGCCCTCAATGTCTTCTATTACTGCACCTATGA 
1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | 1 | | || | | | | | | | | | | | | | | || | | | | | | | | || 1 1 1 1 1 1 
TAAGCAGCGGGGGCCAGCCGCCGAGGAGGCCCTCAATGTCTTCTATTACTGCACCTATGA 


2046 


Db 


181 


240 


Qy 


2047 


GGGGGCTGTAGACCTGGACCATGTGACAGATGAGCGGGAACGGAAGGCTCTGGAGGGCAT 
M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 
GGGGGCTGTAGACCTGGACCATGTGACAGATGAGCGGGAACGGAAGGCTCTGGAGGGCAT 


2106 


Db 


241 


300 


Qy 


2107 


TATCAGCAACTTTGGGCAGACTCCCTGTCAGCTGCTGAAGGAGCCACATCCAACTCGGCT 
M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I || I I I I | | | | | | | | || | | 
TATCAGCAACTTTGGGCAGACTCCCTGTCAGCTGCTGAAGGAGCCACATCCAACTCGGCT 


2166 


Db 


301 


360 


Qy 


2167 


CTCAGCTGAGGAAGCAGCCCATCGCCTTGCACGCCTGGACACTAACTCACCTAGCATCTT 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 I | I I | | | | | | | | | | | | | | | | | | | | | | || | | | 

CTCAGCTGAGGAAGCAGCCCATCGCCTTGCACGCCTGGACACTAACTCACCTAGCATCTT 


2226 


Db 


361 


420 


Qy 


2227 


CCAGCACCTGGACGAACTCAAGGCATTCTTCGCAGA 
1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I 1 I I | | | | | | 

CCAGCACCTGGACGAACTCAAGGCATTCTTCGCAGAGGTTGTCAGTGATGGTGTACCCCT 


2262 


Db 


421 


480 



Qy 


2263 




2262 


Db 


481 


GGTGCTAGCCCTGGTCCCCCACCGGCAGCCCCACTCCTTCATCACCCAGGGTTCCCCAGA 


540 


Qy 


2263 


GGTGACTGTGAGTGCCAGTGGGCTGCTGGGCACCCACAGCT-GGTTGCCCTATG 

1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 | 

CCTGTTGGTGACTGTGAGTGCCAGTGGGCTGCTGGGCACCCACAGCTGGGTTGCCCTATG 


2315 


Db 


541 


600 


Qy 


2316 


AC C G C AAC AT AAG C AAC T AC T T C AG CT T C AG C AAAGAC C C C AC CAT GG G C AG C C AC AAGA 

M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | | | | | | | | M 1 1 1 1 1 II 1 1 1 1 1 1 

AC C G C AAC AT AAG C AAC T AC T T C AG C T T C AG C AAAGAC C C C AC CAT G G G C AG C C AC AAGA 


2375 


Db 


601 


660 


Qy 


2376 


CGCAGCGACTGCTGAGTGGCCCGTGGGTGCCAGGCAGTGGTGTGAGTGGACAAGCACTGG 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 II 1 1 1 1 II 1 1 
CGCAGCGACTGCTGAGTGGCCCGTGGGTGCCAGGCAGTGGTGTGAGTGGACAAGCACTGG 


2435 


Db 


661 


720 


Qy 


2436 


CAGTGGCCCCGGATGGAAAGCTGCTATTCAGCGGTGGCCACTGGGATGGCAGCCTGCGGG 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 I I I I I | | | | 
CAGTGGCCCCGGATGGAAAGCTGCTATTCAGCGGTGGCCACTGGGATGGCAGCCTGCGGG 


2495 


Db 


721 


780 


Qy 


2496 


TGACTGCACTACCCCGTGGCAAGCTGTTGAGCCAGCTCAGCTGCCACCTTGATGTAGTAA 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 II II 1 1 1 1 1 1 M 1 1 1 1 1 II 

TGACTGCACTACCCCGTGGCAAGCTGTTGAGCCAGCTCAGCTGCCACCTTGATGTAGTAA 


2555 


Db 


781 


840 


Qy 


2556 


CCTGCCTTGCACTGGACACCTGTGGCATCTACCTCATCTCAGGCTCCCGGGACACCACGT 
1 M 1 1 M 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 I I I I I I | | 
CCTGCCTTGCACTGGACACCTGTGGCATCTACCTCATCTCAGGCTCCCGGGACACCACGT 


2615 


Db 


841 


900 


Qy 


2616 


GCATGGTGTGGCGGCTCCTGCATCAGGGTGGTCTGTCAGTAGGCCTGGCACCAAAGCCTG 

1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | M 1 1 1 1 1 1 1 1 1 1 II 

GCATGGTGTGGCGGCTCCTGCATCAGGGTGGTCTGTCAGTAGGCCTGGCACCAAAGCCTG 


2675 


Db 


901 


960 


Qy 

Db 


2676 
961 


TGCAGGTCCTGTATGGGCATGGGGCTGCAGTGAGCTGTGTGGCCATCAGCACTGAACTTG 
1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 | | | | 
TGCAGGTCCTGTATGGGCATGGGGCTGCAGTGAGCTGTGTGGCCATCAGCACTGAACTTG 


2735 
1020 


Qy 


2736 


ACATGGCTGTGTCTGGATCTGAGGATGGAACTGTGATCATACACACTGTACGCCGCGGAC 
1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I 
ACAT GGCTGTGTCTG GAT CT GAG GAT G GAAC T GT GAT CATAC ACAC T GTAC G C C G C G GAC 


2795 


Db 


1021 


1080 


Qy 


2796 


AGTTTGTAGCGGCACTACGGCCTCTGGGTGCCACATTCCCTGGACCTATTTTCCACCTGG 
1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 
AGTTTGTAGCGGCACTACGGCCTCTGGGTGCCACATTCCCTGGACCTATTTTCCACCTGG 


2855 


Db 


1081 


1140 


Qy 


2856 


CATTGGGGTCCGAAGGCCAGATTGTGGTACAGAGCTCAGCGTGGGAACGTCCTGGGGCCC 
1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I II 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 
CATTGGGGTCCGAATGCCAGATTGTGGTACAGAGCTCAGCGTGGGAACGTCCTGGGGCCC 


2915 


Db 


1141 


1200 


Qy 


2916 


AGGTCACCTACTCCTTGCACCTGTATTCAGTCAATGGGAAGTTGCGGGCTTCACTGCCCC 
1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 M 1 1 11 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 I I I I I | I | | I M 1 1 1 1 
AGGTCACCTACTCCTTGCACCTGTATTCAGTCAATGGGAAGTTGCGGGCTTCACTGCCCC 


2975 


Db 


1201 


1260 


Qy 

Db 


2976 
1261 


TGGCAGAGCAGCCTACAGCCCTGACGGTGACAGAGGACTTTGTGTTGCTGGGCACCGCCC 
1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | 1 1 1 | | 1 | 1 
TGGCAGAGCAGCCTACAGCCCTGACGGTGACAGAGGACTTTGTGTTGCTGGGCACCGCCC 


3035 
1320 



Qy 303 6 AGTGCGCCCTGCACATCCTCCAACTAAACACACTGCTCCCGGCCGCGCCTCCCTTGCCCA 3 0 95 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II M | 

Db 1321 AGTGCGCCCTGCACATCCTCCAACTAAACACACTGCTCCCGGCCGCGCCTCCCTTGCCCA 13 8 0 

Qy 309 6 TGAAGGTGGCCATCCGCAGCGTGGCCGTGACCAAGGAGCGCAGCCACGTGCTGGTGGGCC 3155 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 1381 TGAAGGTGGCCATCCGCAGCGTGGCCGTGACCAAGGAGCGCA-CCACGTGCTGGTGGGCC 1439 

Qy 3156 TGGAGGATGGCAAGCTCATCGTGGTGGTCGCGGGGCAGCCCTCTGAGGTGCGCAGCAGCC 3215 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I II || 

Db 14 4 0 TGGAGGATGGCAAGCTCATCGTGGTGGTCGCGGGGCAGCCCTCTGAGGTGCGCACAGCCC 14 99 

Qy 3216 AGTTCGCGCGGAAGCTGTGGCGGTCCTCGCGGCGCATCTCCCAGGTGTCCTCGGGAGAGA 32 75 

II I I I I I I I I I I I I I I I I II I I I II I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I II 

Db 1500 AGTTCGCGCGGAAGCTGTGGCGGTCCTCGCGGCGCATCTCCCAGGTGTCCTCGGGAGAGA 1559 

Qy 327 6 CGGAATACAACCCTACTGAGGCGCGCTGAACCTGGC 3311 

I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I 
Db 15 60 CGGAATACAACCCTACTGAGGCGCGCTGAACCTGGC 15 95 



Application/Control Number: 09/990,437 
Art Unit: 1646 
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SEQUENCE COMPARISON GenBank AB011112 



AC 060288; 

DT 01-AUG-1998 (TrEMBLrel. 07, Created) 

DT 01- OCT -2002 (TrEMBLrel . 22, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel . 23, Last annotation update) 

DE KIAA0540 protein (Fragment) . 

GN KIAA054 0. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606 ; 
RN [1] 

RP SEQUENCE FROM N . A. 

RC TISSUE=Brain ; 

RX MEDLINE=98290545; PubMed=96285 81 ; 

RA Nagase T. # Ishikawa K. , Miya j ima N., Tanaka A., Kotani H., Nomura N. , 

RA Ohara O . ; 

RT "Prediction of the coding sequences of unidentified human genes. IX. 

RT The complete sequences of 100 new cDNA clones from brain which can 

RT code for large proteins in vitro."; 

RL DNA Res. 5:31-39(1998). 

DR EMBL ; AB011112; BAA25466.3; - . 

DR InterPro; IPR000409; Beige_BEACH. 

DR InterPro; IPR0016 80; WD4 0. 

DR Pfam; PF02138; Beach; 1. 

DR Pfam; PF00400; WD40; 2. 

DR ProDom; PD007848; B e i g e_B EACH ; 1. 

DR SMART; SM00320; WD40; 3. 

DR PROSITE; PS50197; BEACH; 1. 

DR PROSITE; PS50082; WD_REPEATS_2 ; 2. 

DR PROSITE; PS50294; WD_REPEATS_REGION ; 1. 

KW Repeat ; WD repeat . 

FT NON_TER 1 1 

SQ SEQUENCE 2041 AA; 224474 MW; CC523 F1D5D04 1F51 CRC64 ; 

Query Match 99.5%; Score 5222; DB 4; Length 2041; 

Best Local Similarity 97.1%; Pred. No. 0; 

Matches 10 03; Conservative 0; Mismatches 0; Indels 30; Gaps 1; 
Qy 1 MSQFEMDTYAKSHDLMSGFWNACYDMLMSSGQRRQWERAQSRRAFQELVLEPAQRRARLE 6 0 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIII 

Db 1009 MSQFEMDTYAKSHDLMSGFWNACYDMLMSSGQRRQWERAQSRRAFQELVLEPAQRRARLE 1068 

Qy 61 GLRYTAVLKQQATQHSMALLHWGALWRQLASPCGAWALRDTPIPRWKLSSAETYSRMRLK 120 

IIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIII 

Db 106 9 GLRYTAVLKQQATQHSMALLHWGALWRQLASPCGAWALRDTPIPRWKLSSAETYSRMRLK 1128 

Qy 121 LVPNHHFDPHLEASALRDNLGEVPLTPTEEASLPLAVTKEAKVSTPPELLQEDQLGEDEL 180 

.< ' Il ' I I I! II '! i . 

Db 112 9 LVPNHHFDPHLEASALRDNLGEVPLTPTEEASLPLAVTKEAKVSTPPELLQEDQLGEDEL 1188 

Qy 181 AELETPM EAAELDEQREKLVLSAECQLVTWAWPGLLEVTTQNVYFYDGSTERVETEEG 240 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMII 

Db 118 9 AELETPMEAAELDEQREKLVLSAECQLVTWAWPGLLEVTTQNVYFYDGSTERVETEEG 124 8 
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Qy 241 IGYDFRRPLAQLREWLRRFNLRRSALELFFIDQANYFLNFPCKVGTTPVSSPSQTPRPQ 300 

MIIMIMIMI III I ; 1 1 J , , 1 1 1 ' I [ 1 1 1 , 1 1 1 1 < i 

Db 1249 IGYDFRRPLAQLREVHLRRFNLRRSALELFFIDQANYFLNFPCKVGTTPVSSPSQTPRPQ 1308 

Qy *301 PGPIPPHTQVRNQVYSWLLRLRPPSQGYLSSRSPQEMLRASGLTQKWVQREISNFEYLMQ 360 

IIIIMIIIIIIIMIIIIIMIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 1309 PGPIPPHTQVRNQVYSWLLRLRPPSQGYLSSRSPQEMLRASGLTQKWVQREISNFEYLMQ 1368 

Qy 361 LNTIAGRTYNDLSQYPVFPWLQDYVSPTLDLSNPAVFRDLSKPIGWNPKHAQLVREKY 420 

lllllllllllllllllllllllllllllllllllllllllllllllllllllll Mill 

Db 1369 LNTIAGRTYNDLSQYPVFPWVLQDYVSPTLDLSNPAVFRDLSKPIGWNPKHAQLVREKY 1428 

Qy 421 ESFEDPAGTIDKFHYGTHYSNAAGVMHYLIRVEPFTSLHVQLQSGRFDCSDRQFHSVAAA 480 

IIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIII 

Db 1429 ESFEDPAGTIDKFHYGTHYSNAAGVMHYLIRVEPFTSLHVQLQSGRFDCSDRQFHSVAAA 1488 

Qy 481 WQARLESPADVKELIPEFFYFPDFLENQNGFDLGCLQLTNEKVGDVVLPPWASSPEDFIQ 540 

MIIIIIIMIMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIM 

Db 1489 WQARLESPADVKELIPEFFYFPDFLENQNGFDLGCLQLTNEKVGDWLPPWASSPEDFIQ 154 8 

Qy 541 QHRQALESEYVSAHLHEWIDLIFGYKQRGPAAEEALNVFYYCTYEGAVDLDHVTDERERK 600 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

Db 1549 QHRQALESEYVSAHLHEWIDLIFGYKQRGPAAEEALNVFYYCTYEGAVDLDHVTDERERK 1608 
Qy 601 ALEGI I SNFGQTPCQLLKEPHPTRLSAEEAAHRLARLDTNSPS I FQHLDELKAFFAE 657 

1 1 1 1 r 1 1 j j 1 1 1 1 1 1 1 1 1 1 1 1 1 j j 1 1 1 1 1 1 j j r j 1 1 f 1 1 1 e 1 1 r 1 1 j 1 1 j 1 1 1 1 1 1 1 

Db 1609 ALEGI I SNFGQTPCQLLKEPHPTRLSAEEAAHRLARLDTNSPS I FQHLDELKAFFAEWS 1668 

Qy 658 VTVSASGLLGTHS WLP YDRN I SNYFS F SKD PTM 6 90 

II I I ' : I I i I j I I , - I I I I I I I I III 
Db 1669 DGVPLVLALVPHRQPHS F I TQGSPDLLVTVSASGLLGTHS WLP YDRN I SNYFS FSKD PTM 1728 

Qy 6 91 GSHKTQRLLSGPWPGSGVSGQALAVAPDGKLLFSGGHWDGSLRVTALPRGKLLSQLSCH 75 0 

I I I . I I 1 ! i I I I i ' I I I . II I i I I I I I I ; I I I I . I I i I I ' I I I , I I I I ! 

Db 1729 GSHKTQRLLSGPWVPGSGVSGQALAVAPDGKLLFSGGHWDGSLRVTALPRGKLLSQLSCH 1788 

Qy 751 LDWTCLALDTCGIYLISGSRDT TCMVWRLLHQGGLSVGLAPKPVQVLYGHGAAVSCVAI 810 

M IIIIIIIIMIIII'llllllllll' III lllli'MII Ml ,|j I HI 

Db 1789 LDWTCLALDTCGIYLISGSRDTTCMVWRLLHQGGLSVGLAPKPVQVLYGHGAAVSCVAI 1848 

Qy 811 STELDMAVSGSEDGTVI IHTVRRGQFVAALRPLGATFPGPI FHLALGSEGQIVVQSSAWE 870 

IIIIMMIIIIIIIIIIIIIIIIMIIIIIIIMMIIIIIIIIIIIIMIIIIIMII 

Db 1849 STELDMAVSGSEDGTVI IHTVRRGQFVAALRPLGATFPGPI FHLALGSEGQIVVQSSAWE 1908 

Qy 871 RPGAQVTYSLHLYSVNGKLRASLPLAEQPTALTVTEDFVLLGTAQCALHILQLNTLLPAA 93 0 

IMIIIIIIIIIIIIIIIIIIIIIIIMMIMIIIIIIIIIIIIIIIIIIMIIIIIM 

Db 1909 RPGAQVTYSLHLYSVNGKLRASLPLAEQPTALTVTED FVLLGTAQCALHILQLNTLLPAA 1968 

Qy 931 PPLPMKVAIRSVAVTKERSHVLVGLEDGKLIVWAGQPSEVRSSQFARKLWRSSRRISQV 990 

lllllll IIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIII 

Db 1969 PPLPMKVAIRSVAVTKERSHVLVGLEDGKLIVWAGQPSEVRSSQFARKLWRSSRRISQV 2028 

Qy 991 SSGETEYNPTEAR 1003 

lllllllllllll 
Db 2 02 9 SSGETEYNPTEAR 2 041 



LOCUS AB011112 6364 bp mRNA linear PRI 05-JUL-2002 

DEFINITION Homo sapiens .mRNA for KIAA0540 protein, partial cds . 
ACCESSION AB011112 

VERSION AB011112.3 GI:20521076 

KEYWORDS 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 

AUTHORS Nagase,T., Ishikawa,K., Miyajima,N., Tanaka,A. , Kotani,H., 

Nomura, N. and Ohara,0. 
TITLE Prediction of the coding sequences of unidentified human genes. IX. 

The complete sequences of 100 new cDNA clones from brain which can 

code for large proteins in vitro 
JOURNAL DNA Res. 5 (1), 31-39 (1998) 
MEDLINE 98290545 
PUBMED 962 8581 
REFERENCE 2 (bases 1 to 6364) 

AUTHORS Ohara,0., Nagase,T. and Ishikawa,K. 
TITLE Direct Submission 

JOURNAL Submitted ( 13-FEB-1998 ) Osamu Ohara, Kazusa DNA Research Institute, 
Laboratory of DNA Technology; Yana 1532-3, Kisarazu, Chiba 
2 92-0812, Japan (E-mail : cdnainf o@kazusa . or . jp, Tel : +81-438-52-3913, 
Fax:+81-438-52-3914) 
COMMENT On May 9, 2002 this sequence version replaced gi: 6635198. 

FEATURES Location/Qualifiers 
source 1 . .6364 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref="taxon: 9606" 

/clone="pg00116" 

/sex="male" 

/tissue_type= "brain" 

/clone_lib="pBluescriptII SK plus" 

/note="This sequence was replaced that of hg04185 cDNA as 

a representative cDNA sequence for KIAA05 4 0." 
gene 1 . .6364 

/gene="KIAA0540" 
CDS <1. .6128 

/gene="KIAA0540" 

/note-"Start codon is not identified" 
/ codon start=3 
/product="KIAA0540 protein" 
/protein_id="BAA25466. 3" 
/db_xref="GI : 20521077" 

/translation^* PMELRHLLRPRPGLDSEPGGAEAGKARHAGAVIRTLSGMARHQG 
PARALRYFDLTPSMAGIMVPPVQRWPGPGFTFHAWLCLHPMDTAPTPAPTRPLQRKQL 
YSFFTSSGSGFEAFFTAAGTLWAVCTRKEYLTMSLPEVSFADSAWHCVAIVHVPGRR 
PFSQNLVHVYKDGHLVKTAPLRCPSLSEPFSSCCIGSAGYRTTTTTTGLPTPPVPATL 
AYTHPALTRSQSVPASTGLGWGSGLVAPLQEGSIDSTLAGTQDTRWGSPTSLEGELGA 
VAI FHEALQATALRTLCTLGPNETAPFKPEGELHELSTRLLLHYSPQACKNNICLDLS 
PSHGLDGRLTGHRVETWDVKDWNCVGGMGALLPLLERVAAQPKEAEAGPAETHDLVG 
PELTSGHNTQGLVLPLGKSSEERMERNAVAAFLLMLRNFLQGHMVNQESLVQCQGPAI 
IGALLRKVPSWAMDMNVLMSAQLLMEQVAAEGSGPLLYLLYQHLLFN FHLWTLSDFAV 
RLGHIQYMSSIVREHRQKLRKKYGVQFILDALRTHYSPQRERPIAADDLRTVQTSLLG 
LAREFLVRSLSADDVQVTQTMLSFLAATGDDGQILRRLQQNERLPERSRQRLRLRECG 



pug I? 



LQGLVACLPEGTVSPQLCQGLYKLFLGAGSGNTAGGGGSSGTLTPASQPGTPSPLDGP 
RPFPAAPGRHSSSLSNVLEDGSLPEPTISGDDTSNTSNPQQTSEEELCNLLTNVLFSV 
TWRGVEGSDEAAWRERGQVFSVLTQLGASATLVRPPDCIKRSLLEMMLESALTDIKEA 
PVGVLASLTQQALWLLRLLQDFLCAEGHGNQELWSEKLFEGVCSLLDRLGAWPHLANG 
TADLREMAQI GLRLVLGYI LLEDPQLHAQAYVRLHMLLQTAVPARREEACYVLS KLEA 
ALGRVLNTSSLESATDEAGSPIAAAAAAAAAERCSWLVPLVRTLLDRAYEPLGLQWGL 
PSLPPTNGSPTFFEDFQAFCATPEWRHFIDKQVQPTMSQFEMDTYAKSHDLMSGFWNA 
CYDMLMSSGQRRQWERAQSRRAFQELVLEPAQRRARLEGLRYTAVLKQQATQH SMALL 
HWGALWRQLASPCGAWALRDTPIPRWKLSSAETYSRMRLKLVPNHHFDPHLEASALRD 
NLGEVPLTPTEEASLPLAVTKEAKVSTPPELLQEDQLGEDELAELETPMEAAELDEQR 
EKLVLSAECQLVTWAWPGLLEVTTQNVYFYDGSTERVETEEGIGYDFRRPLAQLRE 
VHLRRFNLRRSALELFFIDQANYFLNFPCKVGTTPVSSPSQTPRPQPGPIPPHTQVRN 
QVYSWLLRLRPPSQGYLSSRSPQEMLRASGLTQKWVQREISNFEYLMQLNTIAGRTYN 
DLSQYPVFPWVLQDYVSPTLDLSNPAVFRDLSKPIGWNPKHAQLVREKYESFEDPAG 
TIDKFHYGTHYSNAAGVMHYLIRVEPFTSLHVQLQSGRFDCSDRQFHSVAAAWQARLE 
SPADVKELIPEFFYFPDFLENQNGFDLGCLQLTNEKVGDWLPPWASSPEDFIQQHRQ 
ALESEYVSAHLHEWI DLI FGYKQRGPAAEEALNVFYYCTYEGAVDLDHVTDERERKAL 
EGIISNFGQTPCQLLKEPHPTRLSAEEAAHRLARLDTNSPSIFQHLDELKAFFAEWS 
DGVPLVLALVPHRQPHSFITQGSPDLLVTVSASGLLGTHSWLPYDRNI SNYFSFSKDP 
TMGSHKTQRLLSGPWVPGSGVSGQALAVAPDGKLLFSGGHWDGSLRVTALPRGKLLSQ 
LSCHLDVVTCIALDTCGIYLISGSRDTTCMVWRLLHQGGLSVGLAPKPVQVLYGHGAA 
VS CVAI STELDMAVS GS EDGTVI I HTVRRGQFVAALRP LGAT FP GP I FHLALGS EGQI 
WQSSAWERPGAQVTYSLHLYSVNGKLRASLPLAEQPTALTVTEDFVLLGTAQCALHI 
LQLNTLLPAAPPLPMKVAIRSVAVTKERSHVLVGLEDGKLIVWAGQPSEVRSSQFAR 
KLWRSSRRISQVSSGETEYNPTEAR" 



ORIGIN 



Query Match 96.0%; Score 3391.2; DB 9; Length 6364; 

Best Local Similarity 97.4%; Pred. No. 0; 

Matches 3504; Conservative 0; Mismatches 3; Indels 91; Gaps 2; 

GCTGAACACCTCTTCTTTGGAGTCAGCCACTGATGAGGCAGGGTCCCCACTTGCAGCTGC 8 4 

| | | | | | | | M I I I I I I II I I I I I I II I 1 I I II I I I I I I I I I I I I I I I I I I I I M 

GCTGAACACCTCTTCCTTGGAGTCAGCCACTGATGAGGCAGGGTCCCCACTTGCAGCTGC 2 818 

AGCAGCTGCAGCAGCTGCAGAGCGCTGCTCCTGGCTGGTGCCACTGGTGCGCACGCTGCT 14 4 

I I I I M M I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 

AGCAGCTGCAGCAGCTGCAGAGCGCTGCTCCTGGCTGGTACCACTGGTGCGCACGCTGCT 2 87 8 
AGACCGTGCCTATGAGCCGCTGGGGCTGCAGTGGGGACTGCCCTCCCTGCCACCCACCAA 2 04 

| | | | | | I I I I II I I I I I I I I I I I I I I II I I I I I I I I I M II I I I I I II I I I M I I 

AGACCGTGCCTATGAGCCGCTGGGGCTGCAGTGGGGACTGCCCTCCCTGCCACCCACCAA 2 938 

TGGCAGCCCCACCTTCTTTGAAGACTTCCAGGCTTTTTGTGCCACACCCGAATGGCGCCA 2 64 

| | | | | | | | I || I I I I I I I I I I II II I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 
TGGCAGCCCCACCTTCTTTGAAGACTTCCAGGCTTTTTGTGCCACACCCGAATGGCGCCA 2 99 8 

C T T CAT C GAC AAAC AG GT AC AG C C AAC CAT GT C C C AGT T C GAAAT G GAC AC GT AT GCT AA 32 4 

| || | | | || I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 

C T T CAT C GAC AAAC AG GT AC AG C C AAC CAT GT C C C AGT T C GAAAT G GAC AC GT AT G CT AA 3 05 8 

GAG C CAC GAC CT T AT GT CAG GT T T CT G GAAT G C C T G C TAT GACAT GCT TAT GAGC AGT GG 3 8 4 
| | | | | | | | | | | | I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I M I M I I I I I I I I I 
GAG C CAC GAC C T TAT GT CAG GT T T C T G GAAT G C CT G CT AT GACAT GCT TAT GAGC AGT G G 3118 

GCAGCGGCGCCAGTGGGAGCGCGCCCAGAGTCGTCGGGCCTTCCAGGAGCTGGTGCTGGA 4 4 4 

I I I I I I M | M | I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I M I I I I I I 



Qy 


25 


Db 


2759 


Qy 


85 


Db 


2819 


QY 


145 


Db 


2879 


QY 


205 


Db 


2939 


QY 


265 


Db 


2999 


QY 


325 


Db 


3059 


QY 


385 



^c\e Z o 



Db 


3119 


Qy 


445 


Db 


3179 


Qy 


505 


Db 


3239 


Qy 


565 


Db 


3299 


Qy 


625 


Db 


3359 


Qy 


685 


Db 


3419 


Qy 


745 


Db 


3479 


Qy 


805 


Db 


3539 


Qy 


865 


Db 


3599 


Qy 


925 


Db 


3659 


Qy 


985 


Db 


3719 


Qy 


1045 


Db 


3779 


Qy 


1105 


Db 


3839 


Qy 


1 1 f c 
llDJ 


Db 


3899 


Qy 


1225 


Db 


3959 



3119 GCAGCGGCGCCAGTGGGAGCGCGCCCAGAGTCGTCGGGCCTTCCAGGAGCTGGTGCTGGA 317 8 
ACCTGCGCAGAGGCGGGCGCGCCTGGAGGGGCTACGCTACACGGCAGTGCTG7\AGCAGCA 5 04 

M | | I I I I I M I II I I II I I M I I I I I I I I M I I I I I I I I I I I I I M I I M I I I I I M I I 

ACCTGCGCAGAGGCGGGCGCGCCTGGAGGGGCTACGCTACACGGCAGTGCTGAAGCAGCA 323 8 

GGCAACGCAGCACTCCATGGCCCTGCTGCACTGGGGGGCGCTGTGGCGCCAGCTCGCCAG 5 64 
I I | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GGCAACGCAGCACTCCATGGCCCTGCTGCACTGGGGGGCGCTGTGGCGCCAGCTCGCCAG 32 9 8 

CCCATGTGGGGCCTGGGCGCTGAGGGACACTCCCATCCCCCGCTGGAAACTGTCCAGCGC 62 4 

|| | | | | | I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CCCATGTGGGGCCTGGGCGCTGAGGGACACTCCCATCCCCCGCTGGAAACTGTCCAGCGC 335 8 

CGAGACATATTCACGCATGCGTCTGAAGCTGGTGCCCAACCATCACTTCGACCCTCACCT 6 84 

| | | | | | | I I I I I || II I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I M 
CGAGACATATTCACGCATGCGTCTGAAGCTGGTGCCCAACCATCACTTCGACCCTCACCT 3 418 

GGAAGCCAGCGCTCTCCGAGACAATCTGGGTGAGGTTCCCCTGACACCCACCGAGGAGGC 7 4 4 

I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
GGAAGCCAGCGCTCTCCGAGAC7VATCTGGGTGAGGTTCCCCTGACACCCACCGAGGAGGC 347 8 

CTCACTGCCTCTGGCAGTGACCAAAGAGGCC7VAAGTGAGCACCCCACCCGAGTTGCTGCA 8 04 

I | || I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I M I I I M I I I I I I I I I I I I I I 

CTCACTGCCTCTGGCAGTGACCAAAGAGGCCAAAGTGAGCACCCCACCCGAGTTGCTGCA 3538 

GGAGGACCAGCTCGGCGAGGACGAGCTGGCTGAGCTGGAGACCCCGATGGAGGCAGCAGA 8 64 

| | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
GGAGGACCAGCTCGGCGAGGACGAGCTGGCTGAGCTGGAGACCCCGATGGAGGCAGCAGA 3 5 9 8 

ACTGGATGAGCAGCGTGAGAAGCTGGTGCTGTCGGCCGAGTGCCAGCTGGTGACGGTAGT 92 4 

| | || | | I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ACTGGATGAGCAGCGTGAGAAGCTGGTGCTGTCGGCCGAGTGCCAGCTGGTGACGGTAGT 3658 

GGCCGTGGTCCCAGGGCTGCTGGAGGTCACCACACAGAATGTATACTTCTACGATGGCAG 984 

I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I 

GGCCGTGGTCCCAGGGCTGCTGGAGGTCACCACACAGAATGTATACTTCTACGATGGCAG 3718 

CACTGAGCGCGTGGAAACCGAGGAGGGCATCGGCTATGATTTCCGGCGCCCACTGGCCCA 104 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CACTGAGCGCGTGGAAACCGAGGAGGGCATCGGCTATGATTTCCGGCGCCCACTGGCCCA 377 8 

GCTGCGTGAGGTCCACCTGCGGCGTTTCAACCTGCGCCGTTCAGCACTTGAGCTCTTCTT 1104 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
GCTGCGTGAGGTCCACCTGCGGCGTTTCAACCTGCGCCGTTCAGCACTTGAGCTCTTCTT 3 8 3 8 

TATCGATCAGGCCAACTACTTCCTCAACTTCCCATGCAAGGTGGGCACGACCCCAGTCTC 1164 

| | I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TATCGATCAGGCCAACTACTTCCTCAACTTCCCATGCAAGGTGGGCACGACCCCAGTCTC 3 8 9 8 

ATCTCCTAGCCAGACTCCGAGACCCCAGCCTGGCCCCATCCCACCCCATACCCAGGTACG 1224 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ATCTCCTAGCCAGACTCCCAGACCCCAGCCTGGCCCCATCCCACCCCATACCCAGGTACG 3 95 8 

GAACCAGGTGTACTCGTGGCTCCTGCGCCTACGGCCCCCCTCTC7VA.GGCTACCTAAGCAG 12 8 4 

| I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I M I II I I I I I I I I I I I 
GAACCAGGTGTACTCGTGGCTCCTGCGCCTACGGCCCCCCTCTCAAGGCTACCT7\AGCAG 4 018 



Qy 


1285 


Db 


4019 


QY 


1345 


Db 


4079 


Qy 


1405 


Db 


4139 


Qy 


1465 


Db 


4199 


Qy 


1525 


Db 


4259 


Qy 


1585 


Db 


4319 


Qy 


1645 


Db 


4379 


Qy 


1705 


Db 


4439 


Qy 


1765 


Db 


4499 


Qy 


1825 


Db 


4559 


Qy 


1885 


Db 


4619 


Qy 


1945 


Db 


4679 


Qy 


2005 


Db 


4739 


Qy 


2065 


Db 


4799 



CCGCTCCCCCCAGGAGATGCTGCGTGCCTCAGGCCTTACCCAGAAATGGGTACAGCGTGA 13 44 

M II I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I M I I I I I I I I I I I I I I I I I I I I 
CCGCTCCCCCCAGGAGATGCTGCGTGCCTCAGGCCTTACCCAGAAATGGGTACAGCGTGA 4 07 8 

GAT AT C CAACT T C GAG T AC T T GAT GC AAC T CAAC ACCAT TGCGGGGC GGAC C T ACAAT GA 14 04 

I I I I I I I I I I I I I II II I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GAT AT C CAAC T T C GAGT ACT T GAT GCAAC T CAAC AC CAT TGCGGGGCG GAC C T ACAAT GA 4138 

CCTGTCTCAGTACCCTGTGTTCCCCTGGGTCCTGCAGGACTACGTGTCCCCAACCCTGGA 14 64 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
CCTGTCTCAGTACCCTGTGTTCCCCTGGGTCCTGCAGGACTACGTGTCCCCAACCCTGGA 4198 

CCTCAGCAACCCAGCCGTCTTCCGGGACCTGTCTAAGCCCATCGGTGTGGTGAACCCCAA 1524 

I | || I | I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I M I I I I I M I 
CCTCAGCAACCCAGCCGTCTTCCGGGACCTGTCTAAGCCCATCGGTGTGGTGAACCCCAA 42 58 

GC AT G C C CAG C T C GT GAGG GAGAAGT AT GAAAG CT T T GAG GAC C CAG CAG G GAC CAT T GA 1584 

|| | | I I I I I I I I I I I I I I I I I II II I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I 

GC AT GC C CAG C T C GT GAG G GAGAAGT AT GAAAG C T T T GAGGAC C CAG CAG G GAC CAT T GA 4 318 

C AAGT T C C AC TAT G G C AC C C AC T AC T C C AAT G CAG C AGG C GT GAT G C AC T AC C T CAT C C G 164 4 
I || | | | | | | | I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 
C AAGT T C C ACT AT G G C AC C C ACT AC T C C AAT GC AG CAG G C GT GAT G C AC T AC CT CAT C C G 4 37 8 

CGTGGAGCCCTTCACCTCCCTGCACGTCCAGCTGCAAAGTGGCCGCTTTGACTGCTCCGA 17 04 

|| | | | || I I I I I I I I I I II I I I I I I II I I I I I II I II I I I I I I I I I I I I I I I I M I I I I I 
CGTGGAGCCCTTCACCTCCCTGCACGTCCAGCTGCAAAGTGGCCGCTTTGACTGCTCCGA 44 38 

CCGGCAGTTCCACTCGGTGGCGGCAGCCTGGCAGGCACGCCTGGAGAGCCCTGCCGATGT 17 64 

| | M | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I 
CCGGCAGTTCCACTCGGTGGCGGCAGCCTGGCAGGCACGCCTGGAGAGCCCTGCCGATGT 4 4 98 

GAAGGAGCTCATCCCGGAATTCTTCTACTTTCCTGACTTCCTGGAGAACCAGAACGGTTT 18 24 

| | | I M I M I II I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 
GAAGGAGCTCATCCCGGAATTCTTCTACTTTCCTGACTTCCTGGAGAACCAGAACGGTTT 4558 

TGACCTGGGCTGTCTCCAGCTGACCAACGAGAAGGTAGGCGATGTGGTGCTACCCCCGTG 18 84 

|| || | I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 1 I II I I I I M I I 
TGACCTGGGCTGTCTCCAGCTGACCAACGAGAAGGTAGGCGATGTGGTGCTACCCCCGTG 4 618 

GGCCAGCTCTCCTGAGGACTTCATCCAGCAGCACCGCCAGGCTCTGGAGTCGGAGTATGT 19 44 

I I I I M I I I I I I I I M I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I M I 

GGCCAGCTCTCCTGAGGACTTCATCCAGCAGCACCGCCAGGCTCTGGAGTCGGAGTATGT 4 67 8 

GTCTGCACACCTACAC GAGT GGATC GAC CT CAT CTTTGGCTACAAGCAGCGGGGGCCAGC 2 0 04 

| | | | | I I I I I I I I I I I I I I I II I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GTCTGCACACCTACACGAGTGGATCGACCT CAT CTTTGGCTACAAGCAGCGGGGGCCAGC 4738 

CGCCGAGGAGGCCCTCAATGTCTTCTATTACTGCACCTATGAGGGGGCTGTAGACCTGGA 2 0 64 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I II I I I I I I 

CGCCGAGGAGGCCCTCAATGTCTTCTATTACTGCACCTATGAGGGGGCTGTAGACCTGGA 47 98 

CCATGTGACAGATGAGCGGGAACGGAAGGCTCTGGAGGGCATTATCAGCAACTTTGGGCA 2124 

I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I M 

C CAT GT GAC AGAT GAGC G G GAAC G GAAG G CT C T G GAG GG CAT TAT C AGCAAC T T T G G G CA 4 8 58 



Qy 212 5 GACTCCCTGTCAGCTGCTGAAGGAGCCACATCCAACTCGGCTCTCAGCTGAGGAAGCAGC 2184 

II I I I I I I I I II II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 
Db 4 859 GACTCCCTGTCAGCTGCTGAAGGAGCCACATCCAACTCGGCTCTCAGCTGAGGAAGCAGC 4 918 

Qy 2185 CCATCGCCTTGCACGCCTGGACACTAACTCACCTAGCATCTTCCAGCACCTGGACGAACT 22 44 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4 919 CCATCGCCTTGCACGCCTGGACACTAACTCACCTAGCATCTTCCAGCACCTGGACGAACT 4 97 8 

Qy 2245 CAAGGCATTCTTCGCAGA 2262 

I I I I I I I I I I I I I I I I I I 

Db 4 97 9 CAAGGCATTCTTCGCAGAGGTTGTCAGTGATGGTGTACCCCTGGTGCTAGCCCTGGTCCC 5038 

Qy 2263 GGTGACTGTGAG 2274 

I I I I I I I I I I I I 

Db 5039 CCACCGGCAGCCCCACTCCTTCATCACCCAGGGTTCCCCAGACCTGTTGGTGACTGTGAG 5098 

Qy 2275 TGCCAGTGGGCTGCTGGGCACCCACAGCTGGTTGCCCTATGACCGCAACATAAGCAACTA 2 3 34 

I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 

Db 50 9 9 TGCCAGTGGGCTGCTGGGCACCCACAGCTGGTTGCCCTATGACCGCAACATAAGCAACTA 5158 

Qy 2 33 5 C T T C AG C T T C AG C AAAGAC C C C A C CAT G G G C AG C C AC AAGAC G C AG C GAC T G C T GAGT G G 2 3 94 

II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 515 9 C T T C AGCT T C AG C AAAGAC C C C AC CAT G G G C AG C C AC AAGAC G C AG C GAC T GC T GAGT G G 5218 

Qy 23 95 CCCGTGGGTGCCAGGCAGTGGTGTGAGTGGACAAGCACTGGCAGTGGCCCCGGATGGAAA 2 4 54 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 5219 CCCGTGGGTGCCAGGCAGTGGTGTGAGTGGACAAGCACTGGCAGTGGCCCCGGATGGAAA 52 7 8 

Qy 24 55 GCTGCTATTCAGCGGTGGCCACTGGGATGGCAGCCTGCGGGTGACTGCACTACCCCGTGG 2 514 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 527 9 GCTGCTATTCAGCGGTGGCCACTGGGATGGCAGCCTGCGGGTGACTGCACTACCCCGTGG 53 3 8 

Qy 2515 CAAGCTGTTGAGCCAGCTCAGCTGCCACCTTGATGTAGTAACCTGCCTTGCACTGGACAC 2 574 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 533 9 CAAGCTGTTGAGCCAGCTCAGCTGCCACCTTGATGTAGTAACCTGCCTTGCACTGGACAC 53 98 

Qy 2575 CTGTGGCATCTACCTCATCTCAGGCTCCCGGGACACCACGTGCATGGTGTGGCGGCTCCT 2634 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I M I I I 
Db 53 9 9 CTGTGGCATCTACCTCATCTCAGGCTCCCGGGACACCACGTGCATGGTGTGGCGGCTCCT 54 58 

Qy 2635 GCAT CAGGGTGGT CT GTCAGTAGGCCT GGCACCAAAGCCT GT GCAGGT CCT GTAT GGGCA 2694 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 545 9 GCATCAGGGTGGTCTGTCAGTAGGCCTGGCACCAAAGCCTGTGCAGGTCCTGTATGGGCA 5518 

Qy 2 695 TGGGGCTGCAGTGAGCTGTGTGGCCATCAGCACTGAACTTGACATGGCTGTGTCTGGATC 2 7 54 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 5519 TGGGGCTGCAGTGAGCTGTGTGGCCATCAGCACTGAACTTGACATGGCTGTGTCTGGATC 55 7 8 

Qy 2755 TGAGGATGGAACTGTGATCATACACACTGTACGCCGCGGACAGTTTGTAGCGGCACTACG 2 814 

I I I M I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 557 9 T GAG GAT G GAAC T GT GAT CAT AC AC AC T GT AC G C C G C G GAC AGT T T GT AG CG G CAC T AC G 5638 

Qy 2 815 GCCTCTGGGTGCCACATTCCCTGGACCTATTTTCCACCTGGCATTGGGGTCCGAAGGCCA 2 8 74 

I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 5639 GCCTCTGGGTGCCACATTCCCTGGACCTATTTTCCACCTGGCATTGGGGTCCGAAGGCCA 5698 

Qy 2 875 GATTGTGGTACAGAGCTCAGCGTGGGAACGTCCTGGGGCCCAGGTCACCTACTCCTTGCA 2 934 




Db 5 699 GATTGTGGTACAGAGCTCAGCGTGGGAACGTCCTGGGGCCCAGGTCACCTACTCCTTGCA 575 8 

Qy 2935 CCTGTATTCAGTCAATGGGAAGTTGCGGGCTTCACTGCCCCTGGCAGAGCAGCCTACAGC 2994 

I I I I I I I I I I I I I I I I i M I I II II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | 
Db 5759 CCTGTATTCAGTCAATGGGAAGTTGCGGGCTTCACTGCCCCTGGCAGAGCAGCCTACAGC 5818 

Qy 2995 CCTGACGGTGACAGAGGACTTTGTGTTGCTGGGCACCGCCCAGTGCGCCCTGCACATCCT 3054 

I I I I I I II I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I | 
Db 5819 CCTGACGGTGACAGAGGACTTTGTGTTGCTGGGCACCGCCCAGTGCGCCCTGCACATCCT 587 8 

Qy 3055 CCAACTAAACACACTGCTCCCGGCCGCGCCTCCCTTGCCCATGAAGGTGGCCATCCGCAG 3114 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 587 9 CCAACTAAACACACTGCTCCCGGCCGCGCCTCCCTTGCCCATGAAGGTGGCCATCCGCAG 5938 

Qy 3115 CGTGGCCGTGACCAAGGAGCGCAGCCACGTGCTGGTGGGCCTGGAGGATGGCAAGCTCAT 3174 

I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 5 939 CGTGGCCGTGACCAAGGAGCGCAGCCACGTGCTGGTGGGCCTGGAGGATGGCAAGCTCAT 599 8 

Qy 3175 CGTGGTGGTCGCGGGGCAGCCCTCTGAGGTGCGCAGCAGCCAGTTCGCGCGGAAGCTGTG 3234 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 5999 CGTGGTGGTCGCGGGGCAGCCCTCTGAGGTGCGCAGCAGCCAGTTCGCGCGGAAGCTGTG 6058 

Qy 32 35 GCGGTCCTCGCGGCGCATCTCCCAGGTGTCCTCGGGAGAGACGGAATACAACCCTACTGA 3294 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 6059 GCGGTCCTCGCGGCGCATCTCCCAGGTGTCCTCGGGAGAGACGGAATACAACCCTACTGA 6118 

Qy 3295 GGCGCGCTG7\ACCTGGCCAGTCCGGCTGCTCGGGCCCCGCCCCCGGCAGGCCTGGCCCGG 3354 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 6119 GGCGCGCTGAACCTGGCCAGTCCGGCTGCTCGGGCCCCGCCCCCGGCAGGCCTGGCCCGG 6178 

Qy 3355 GAGGCCCCGCCCAGAAGTCGGCGGGAACACCCCGGGGTGGGCAGCCCA-GGGGGTGAGCG 3413 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I 
Db 617 9 GAGGCCCCGCCCAGAAGTCGGCGGGAACACCCCGGGGTGGGCAGCCCAGGGGGGTGAGCG 623 8 

Qy 3 414 GGGCCCACCCTGCCCAGCTCAGGGATTGGCGGGCGATGTTACCCCCTCAGGGATTGGCGG 347 3 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I 
Db 62 39 GGGCCCACCCTGCCCAGCTCAGGGATTGGCGGGCGATGTTACCCCCTCAGGGATTGGCGG 6298 

Qy 3474 GCGGAAGTCCCGCCCCTCGCCGGCTGAGGGGCCGCCCTGAGGGCCAGCACTGGCGTCT 3 531 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I 
Db 6299 GCGGAAGTCCCGCCCCTCGCCGGCTGAGGGGCCGCCCTGAGGGCCAGCACTGGCGTCT 6356 



